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SUMMARY 


^  , .  .  .  ,  n  k  be  k  given  populations. 


Dist 


Pi 


Let  it 

Assume  that  we  wish  to  find  a  population  be! ITT  ttrarr  <T 
given  control,  if  there  is  any.  from  all  populations  we  may  draw  inde¬ 
pendent  samples  with  distributions  which  are  {at  least  partly)  determin¬ 
ed  by  real  parameters  say.  A  population  r,  .  is  viewed  to  be 


better  than  the  control  if  <j.  >  ctQ,  i  =  l,...,k,  where  n  (  IK  is  a  fixed 


given  constant.  The  goal  is  to  guarantee  at  least  a  probability  P*  of 
making  a  correct  decision  if  <>.  <  n^,  i-1,...,k,  and  to  maximize  the  prob 


ability  of  finding  a  population  better  than  ,  otherwise. 


Two-stage  procedures  of  the  following  type  will  be  studied:  At 
Stage  1,  based  on  samples  X^,...,X'k,  all  populations  are  screened  out 


which  appear  to  be  no  better  than  n  If  none  (exactly  one)  is  left  the 


procedure  stops  and  decides  that  none  (this  one)  is  better  than 


1  f 


more  than  one,  with  i  es, survives  then  one  proceeds  to  Stage  Here 


additional  samples  Y.,  its,  are  dr. 


drawn  and  final  decision  is  made  based 


on 


or  (X .  ,Y  ),  i  (  s. 


-l-l 

A  natural  class  of  two-stage  procedures  is  proposed  which  can  re 


completely  described  and  studied  in  terms  of  Neyman-Pearson  testing  the¬ 
ory,  where  the  urisymmetry  of  tests,  however,  can  be  overcome  to  a  consid¬ 
erable  extent.  As  a  typical  result  it  is  shown  that  optimality  o1  tests 
carries  over  to  optimality  of  two-stage  procedures,  finally,  under  nor¬ 
mality,  comparisons  are  made  in  case  of  k?  with  certain  Bayesian  pro¬ 
cedures  . 


/ 


r 


I.  Introduction 

If  k  populations  are  given  and  we  wish  to  decide  on  the 

basis  of  a  properly  chosen  sampling  scheme  which  one  ol  these  popula¬ 
tions  is  the  best  one  (e.g.  has  the  largest  mean),  various  different  ap¬ 
proaches  and  methods  have  been  studied  up  to  now.  A  complete  overview 
is  provided  by  Gupta  and  Panchapakesan  (1979).  Among  those,  two-stage 
procedures  with  screening  in  the  first  stage  seem  to  be  juite  appropri¬ 
ate,  since  they  are  more  economical  as  one-stage  procedures  but  still 
technically  not  as  complicated  as  sequential  ones.  Nevertheless,  opti¬ 
mal  ity  results  here  are  missing  up  to  now  and  obviously  are  hard  to  find. 
Even  the  implementation  of  a  simple  procedure  (as  that  one  which  uses 
Gupta's  (1965)  maximum  means  procedure  in  the  first  stage  and  the  nat¬ 
ural  final  decision  in  the  second  stage)  in  an  indifference  /one  approacn 
under  the  assumption  of  normality  with  a  common  known  vat  iunce  causes 
considerable  difficulties.  For  details  and  references  see  Tamhanr-  ami 
Bechhofer  (1979),  Gupta  and  Miescke  (1979;  and  Mioscko  and  Sehr  (1980). 

The  situation  becomes  somewhat  fairer  if  we  wish  to  find  a  popula¬ 
tion  better  than  a  control  :i  ,  whether  it  is  known  or  unknown.  This  be¬ 
cause  pairwise  comparisons  are  to  be  made  now  Pe tween  ■■  ■  and  •  inrtoad 

of  ii.  and  n  j  ,  ijtj ,  i,  j  C  l,...,k  let  us  admit  here  additionally  the 
choice  of  a  final  decision  "none  of  the  i"  ,  lat  ions  m  bet  ter  (.nan  the 
control".  Moreover,  let  us  adopt  the  following  basic  ■  •  ri : re/x  n ;  -mo 
goa  1 : 

(’♦-Condition:  Let  P*  f  { U ,  1 )  be  a  predetermined  constant .  proc  dure 

is  said  to  meet  the  I**- condi t ion  if  its  probabi I i ty  ol  mating  the 

final  decision:  "none  of  'he  populations  is  bet  t.er  :  me  the  cos  ft-i.r 


is  at.  least.  P*  whenever  this  decision  is  Mined. 


Goal.:  Among  all  procedures  (ir  a  given  class)  which  meet  the  P*-condit ion 

find  that  procedure  which  maximizes  the  probability  of  deciding  final¬ 
ly  in  favor  of  a  population  better  than  the  control  if  there  is  any. 
The  purpose  of  this  paper  is  to  show  that  a  natural  class  of  two-stage 
procedures,  being  widely  used  in  practice,  can  be  described  and  studied 
within  the  framework  of  Neyman-Pearson  testing  theory,  where  the  unsymme- 
try  of  tests  can  be  overcome  to  a  considerable  extent.  Emphasis  hereby 
is  laid  on  the  basic  structure  and  on  comparisons  of  such  procedures  rather 
than  on  establishing  specified  ones. 

In  Section  2  we  introduce  a  natural  class  i,’  of  two-stage  procedures 
and  derive  a  formula  for  their  probabilities  of  correct  decisions.  As  a 
typical  consequence  it  will  be  demonstrated  in  Section  3  that  two-stage 
procedures  based  on  good  unbiased  one-sample  tests  for  H.  :  "population 
?! ^  is  better  than  vQ"  versus  :  "population  ^  is  inferior  with  respect 
to  -i  ",  which  are  simultaneously  good  tests  for  the  dual  problem  (where 
H.j  and  K..  are  interchanged),  perform  well.  Three  open  questions  conclude 
this  section.  Finally,  in  Section  4  we  study  the  normal  case  and  make 
some  comparisons  with  certain  Bayesian  procedures  in  the  case  of  k  2 
populations. 


2.  Basic  results 


Suppose  that  for  every  i.,  i'l,..  ,k,  we  have  a  family 
^ f i  0  •  '  0 .  C  '  IR  ^ensi ties  with  respect,  to  the  lebesgue  measure 

or  any  counting  measure  on  IR  which  have  a  common  vg-port  ()•  D'  and  may 
be  known  or  onlv  partly  known.  The  assumption  concerning  the  supports 
is  made  for  convenience  to  make  ideas  clearer  and  can  be  weakened  in  cer- 

■  .  '-ay  , 


tain  circumstances.  Let  the  fixed  known  control  be  denoted  by 


and  all  populations  n.  be  called  better  than  the  control  if  ■■  •  ■■  and 

inferior  to  it  if  ik  <  0  .  Let  r  (X^  .  ,X.  )  and  Y .  -  ( Y  .,,...  ,Y  •  ) , 

i  i 

i=l,...,k,  be  samples  from  n ^  available  at  Stage  1  and  Stage  2,  respective¬ 
ly,  where  X-j  ,Y^ .  ,X^,Y^  are  mutually  independent,  and  let  X-  (Xj,...,X^) 
and  Y  =  ( Y -j , . . .  ,Yk) . 

Before  we  are  going  to  define  a  natural  class  of  two-stacje  procedure', 
in  a  concise  way,  let  us  briefly  describe  how  these  procedures  typically 
are  applied  in  practice.  For  every  testing  problem  k. :  ■  •  versus 

Fb :  a.  '•  0  the  experimenter  chooses  a  test  based  on  X-  and  for  h-  ver¬ 
sus  another  one  based  on  Y.  or  (X.,Y.),  i  l,...,k,  takes  two  levels 
'i,  ,r<2  (  (0,1 )  (which  usually  are  small)  and  proceeds  as  follows: 

At  Stage  1  he  discards  all  populations  which  are  not  significant  at 
level  under  the  first  sec  of  tests.  If  none  (exactly  one)  is  left,  he 
decides  that  none  (this  one)  is  better  than  the  control.  Only  if  more 
than  one  population  survives,  he  proceeds  to  Stage  ?. 

At  Stage  2  the  experimenter  draws  additional  samples  Y.  from  those 
11.  's  which  were  selected  at  Stage  1  and  exchanges  hypotheses  and  alterna¬ 
tives  with  respect  to  these  populations.  If  all  these  populations  now 
turn  out  to  be  significant  at  level  under  the  second  set  nt  tests 
(which  is  rather  unlikely  to  happen':  »,,•  de.  ides  tha!  none  of  the  popula¬ 
tions  is  better  than  the  control.  Otherwise,  hr  makes  a  tied!  u,  r 
in  favor  of  that  population  among  the  selected  ones  whies  has  the  lutes; 
p- value  under  the  associated  second  tet. 

if  these  tests  are  upper  level  ,  ^  (respectively  lower  level  .  i 
which  for  simplicity  are  non  raedo: .  t /ed  tor  a  ■  loment.  t  •  1  f  i ,  ideas,  hu  u  ; 
on  real  valued  statistic-  Ij  and  v  .  .  t  t  hen  the  proi  edum 


4 


described  above  can  be  egui valently  described  as  follows:  At  Stage  I  all 
ti^'s  are  selected  with  U •  >  c.  (where  c-  is  the  uj-fractile  of  IF  under 
o.  -  i‘o),  and  final  decision  is  made  in  terms  of  the  largest  V.  among  the 
selected  n.'s,  provided  V.  d.  (where  d.  is  the  x?-quantile  of  V  under 
=  oQ).  The  truncated  version  of  such  procedures  (i.e.  which  perform 
Stage  1  only)  have  been  studied  by  several  authors.  See,  for  example, 

Gupta  and  Sobel  (1958)  and  Lehmann  (1961).  Also  some  work  has  been  done 
in  sequential  setups.  For  references  see  Gupta  and  Panchapakesan  (19/9), 
Chapter  20.  But,  apparently,  no  results  concerning  two-stage  procedures 
of  the  type  described  above  have  appeared  in  literature  until  now.  This 
gives  us  the  motivation  for  the  following  considerations. 

To  begin  with,  let  us  state  without  a  formal  proof  that  by  similar- 
arguments  as  are  used  in  Miescke  (1979a)  it  can  be  shown  that  every  pro¬ 
cedure  of  the  type  described  just  now  -  where  U.,  c.  and  V ■  ,  d .  more  gen¬ 
erally  may  take  values  in  measurable  spaces  Xj  and  y  .  ,  and  where  lb  and 
V1  are  stochastically  non-decreasinq  in  with  respect  to  measurable  total 
orderings  in  X-  and  .  ,  i  =  l,...,k  -  is  a  member  of  the  class  i.  to  be  de¬ 
fined  below. 

To  avoid  confusions  and  to  arrive  at  a  consistent  representation  of 
this  class  let  us  from  now  on  use  only  tests  for  IF  versus  K. ,  i  l,...,k, 
which  take  value  1  as  soon  as  one  observation  falls  outside  support  (J. 

(This  modifies  procedures  only  on  null-sets.)  finally,  several  definitions 
given  in  Miescke  (1979a)  will  be  relevant  in  the  sequel  but  for  brevity 
are  not  repeated  here.  Especially,  tests  may  be  randomized  ones  taking 
values  in  [0,1].  This  typically  occurs  in  discrete  cases  or  in  continuous 
cases  when  nonparametric  (rank)  tests  are  under  concern.  Thus  signi t i< am  e 


') 


statements  as  well  as  p-values  are  understood  to  be  based  on  additional 
randomization  schemes  as  are  used  in  Miescke  (1979a).  To  be  more  specific 
let  A  =  (A.|  , . . .  ,A^)  be  that  one  for  the  first  stage  and  B  =  (B^ , . .  . 
that  one  for  the  second  stage.  Note  that  X,  Y,  A  and  B  are  assumed  to  be 
independent. 

The  class  a  of  two-stage  procedures  : 

For  i=l ,. . . ,k  let 

(1)  c.  =  { s  .  )  r,  be  a  right-cont  i  nuous  and  monotone  (in  <) 

1  l ,  u  ,tt  Lo,  i  J 

unbiased  test  for  H.  versus  K-  based  on  X. ,  which  is  standardiz¬ 
ed  at  o  .  Assume  that  within  Q  i  0  and  s  ^  1.  Let 


(2)  Analogously,  let  y-  =  I  (  j-(J  ^  be  such  a  test  for  i|.  ver¬ 

sus  K-  based  on  (X- ,Y • ) .  Let  .  -  (.  -j . 

For  0  <  ,  1 1  •  1  and  0  -  ^  •  1  let  ( g ,  1  ^ )  denote  tne  following  two- 

stage  procedure: 

Stage  I:  Select  ^  if  p,  (X.,A.),  the  p-value  of  X.  under  ...  is  larger 

s  i 

than  I  -  1 1 ,  i-l,...,k.  If  none  (exactly  one)  of  the  populations  is 
selected,  stop  and  decide  "none  (this  one)  is  better  than  ■•  ".  (<t  he*' 

wise  proceed  to  Stage  2. 

Stage  2:  Among  the  selected  populations  csui  i  de  finally  in  favor  d  (naf 
n  j  which  has  the  largest  p-value  p  ( X ^ , V .  , I '■  -  )  an  h  r  .  ^ .  prnv  u.e  :  > 1 

is  no  smaller  than  ....  Otherwise  decide  that  “none  is  he*  ter  t  nan 

2.  o 

let  A.  be  tne  set  of  all  sucn  two-stage  procedures. 

The  following  result  will  serve  as  our  basic  tool  to  determine 
(  tj  tuples  for  meeting  the  P*-condi  t  ion  as  well  as  to  compare  tne  inn 

formance  of  competing  procedures  s.itisiyin:  tne  l"*-c  -mdi  t  ton 


Theorem  1 .  Le_t  (<,  .  Tor  notational  convenience  let 

=  E.  f,  ■  ^  (X.)  and  F.(-t)  =  E  (X^  is  not  significant  under 

^  ^  ^  k 

S  •  i  1,  i  =  l,...,k.  Then  for  every  non-empty  Dc  il,...,k(  and  •< 

*  »  *  ”U-j 

(2.1)  P.  {final  decision  of  (  s ,  1  -  ij  tails  intoD; 


=  /  n  [E  +(1-E  )!  -  (  0  jd  (  II  [E.  +  d-E  )!  („)] 

_■  A  n  J  .  J  J  ra  1  1 


J  A  A  ^  t  '  I  '  I  '  '  J  \  A  A  i-  } 

<i2  j  f  D  J  J  J  i  c  D 


wAe1e_  integration  is  with  respect  to  i.  Moreover, 

(2.2)  P  {final  decision  of  (v  J--‘j  ^  does  not  tall  into  k  } 


n  C E . + ( 1  - e , ) r , ( ,,)]  . 


vl  J  .1 


Proof:  It  is  shown  in  Miescke  (1979a)(cf.  (2.3)  there)  that  tire  distribu¬ 

tion  function  of  each  p-value  appearing  in  ( s  , 1  - )  equals  to  the 
power  function  of  the  corresponding  test,  which  hereby  is  thought  of  being 
a  function  of  i€[0,l]  where  parameter  «  €  ■;  k  on  the  other  hand  is  field 
f i xed. 

Let  D  <_  ( 1 , . . .  ,k } ,  D  f  0  ,  •_;€<;  k  ,  0  •  ■  1  and  0  ■  ip  1  be  1 1  ic¬ 

ed.  Then 

Pq{ final  decision  falls  into  O' 

=  V  P  (final  decision  is  in  favor  of  «  ! 
r  f  D  - 

=  F  y  P  tthe  n.'s  with  i  Cs  are  selected  and  final 
rCD  s  :r  €  s  - 

decision  is  made  in  favor  of  : 


i  /  .II  f  .  (  x)riKr(  0  [I  ( 1  -L j )  II  f 


r  (  n  s:rfs  its 
i  t  r 


I  I  II  U-E  )r  ( 0  ri  «:  (d({i-t r)i  ( 

r(D  a.  I  s:r(js  i(s  j  $  s  M 

I  t  •' 


Now,  the  integrand  !•••!  equalr.  to  II  [E (1  -E  -  )F  .(  ,)]  and  (  l-L  ;■ 

i  !  1  1  !  ' 

i  t  r 

can  be  replaced  by  E  +(l-t  )1 r(  ,).  Thus  P  final  decision  fall-  int"  ■' 


equals 


1  k 


v  /  II  [E.  +  (l-E  .  )r .  (  OJdLE  *■(  l-E  )i  (  •*) ] 

-  i  n  ,  - 1  1  11  1  r  i 


r  (  I  )  i  =  I 

i/r 


/  II  [e >( i-E . )r . ( t ) ]  ’>  II  > L . - ;  1  - f  ,)i  .;  . ) s u : i  - 

t?  J  ^  D  1  J  1  r  (  !.'  i  (.  U  11 

i  /  r 


I  rT 

-  f  II  [E.  +  (1-E.)r.( i } J d  (  II  [ E .  +  ( 1  - E . ) I  . (  i ) ]  !  . 

‘2  J  <i  P  J  '  1  HD  1  11 

This  completes  the  proof  of  (2.1).  Since  (2.2)  can  be  verifier!  by 
inq  similar  arguments  its  proof  is  omitted  for  brevity. 

Remark:  Note  that  for  i  - 1 , . . .  ,k  we  have1  a  I  so  the  following  representar  ion 

of  EnO-EjKjt  .),  ,  (  h\l  I : 


!»  L-  (0,i-  'hio.-,,  yyn. 

I  1  1 

p  ,  ( •:  p  (x.  ,A .  )•  1- 1,  n  ; 


!  I  -  ■ ,  and  p  ( *. .  ,V  . 


Corollary  1.  Every  two-stage  procedure  (.  .  i  . .  s  a  *.  i  s  f  i  e 

P*-cond i tion  if 


( I  - M  ) f  •’* 


(2.4) 


r 


Pron*-'-  \  iower  bound  for  {?..?}  is  II  L;  which  satis!  ies 


k  k 

II  Li  II 

j  -- 1  1  j  - 1 


II  L  :  ,  (X;)  •  II  i 

1-1  j  '1  ■'  -I  O 


k  o ' 


This  fol  lows  f ron  the  unbiasedness  id  the  t.<  ,t 


Unfortunate 1 v ,  the  dependencies  between 


.  ,  (  A  .  j  (Mel  l  ■  .  ,  t 

1,1-1,  -1  ’  I  ,  ,  -I  -  I 


il,...,k,  make  it  hard  to  find  good  procedures  ii 


i  heret ore  mu  i 


our  results  in  the  sequel  will  be  given  only  with  respect  to  s',  sav,  whet  e 
r..  '  c  *  consists  of  all  procedures  from  &  where  the  tests  ..  in  the  seiuud 


stage  depend  only  on  t he  Yj's  and  not  on  the  s,  i  1,. 


i  he  i nt  •  t  ■ 


ed  reader  is  invited  to  try  to  prove  one  of  th<»  conjecture:  stated  .if  *n 
end  of  this  section. 


Corollary  I'.  A  two-stage  procedure  (  ,  .1-,,  (  x  '  satis  t  ies  Ms 

-  -  -  -  '  ”  i  ~ 

P*-condit ion  if  and  only  if 


u-v  v.i’ 


Proof :  let 


1 -  k  '  o  ’ 


o'  i.  ,  i  -  I 


r  - 


,  t., )  (  {.'  ( i: .  t )  redui  es 


■  v,.))l  V  1  ! 

'j ■' 


which,  by  the  t.nbi  asedrto  s  of  the  test  and  •  no  tail  that  a'(i-a 
b*(l-h)a  :  s  i  no  eac- inq  in  t,  o  <  {  0 ,  i  j ,  a-- Mimes  i!s  lowest  value  at 


t  o 


,  where  the  newer  turn  ‘ion.  are  equal  to  the  ’ evi  I  nt 


the  tests. 


Remark:  It  in  a  procedure  (  ,  1  |  (  j\  every  pair  nt  ,  . 

have  non-ne  taf  ive  t  orrelat  ions  for  ,  i  I . 1  t'whiih,  nt  .  nut 

t  o 

is  uiveti  ’t  tht  pro(  isiu*'"  be  bui  is  In  j.  '  )  ,  I  nt"  I  he  t  >it  lm.ir:  ■  >'  ...  1  si: 


J 


k  fr 

■  ■!  falls  between  (1-^)  and  (l-^'.p.,)  . 

Let  us  from  now  on  adopt  the  following  convention: 

Convention:  All  procedures  from  i» '  are  assumed  to  have  .in  .j  satisfy- 

i ng  (!-g)  =  P*  and  a  small  i^. 

j. 

In  view  of  (2.5)  (1-g)  =  P*  and  i,,  -  0  clearly  is  the  test  choice  * 
minimize  the  expected  overall  sampling  amount  (and  to  make  (2.4)  to  an  ex¬ 
act  condition  with  respect  to  i  ).  But  on  the  other  hand  an  experimenter 
might  feel  restricted  at  not  being  permitted  to  decide  also  at  bta  ,e  l 
aqainst  all  populations.  Thus  let  us  admit  at  least  a  small  Tin's  is 

slightly  conservative  with  repsect  to  (2.5).  But  it  changes  the  probabil¬ 
ities  of  any  events  at  most  by  a  difference  of  (max  ■,  This  fol¬ 

lows  from  the  fact  that  ^  acts  only  on  probabilities  of  events  where  at 
least  two  populations  pass  Stage  1  and  eventually  are  rejected  at  Stage 
To  (jive  a  numerical  example,  take  .j  -•  ••  10  .  The'-  r  k  -  5(10;  we 

have  a  P*  above  0.95  (0.90)  and  to  cnangos  all  probabi i i ties  at  most  by 
-4 

the  amount  of  10  . 


3.  Consequences  and  extensions 

The  following  two  results  will  be  used  repeatedly  in  the  segue  1.  The 
proofs  are  straightforward  using  integrations  by  parts  and  are  mere fore 
omitted  for  brevity. 


Lemma  1.  Let  G.,  G.  :  [0,1]  >■  [0,1  J  be  right-continuous,  non-decrcv. '  ' ng 

with  G  j  ( 1 )  =  G .  ( 1 )  -  1  and  (  -  )G .  (  ( )  fot  i  1  ,....,•(  i  i  ■*  I . k ) , 

rt  I  1 , .  .  .  ,k  |.  Then  for  0  •  ,,,  1 


]  r  k 

)  II  G  ■  (  <  )<l(  II  G  (,)) 

«?  i  1  1  j  ml  ' 


II  G  (  -  )d(  II 
i  1  '  ,  r‘ I  ’ 


(1.1) 


r 


As  a  special  case  of  Lemma  1  we  pel 

Corollary  T.  Let  as  before  arid  H  -  (  i )  •  (1  (0.  <([0,1],  lot  a 

pair  i,  J  c  { I . k  r  wit'5  i  /  j.  Then  for  0  -  •  1 

1  k  lk 

(3.2)  ■  II  (  < )d'i  •  (  < )■  *■  f  II  l)  (  <  )rJ(j .  (  < )  . 

.  i  J  •  i 

*j  c  ■ t  i 

for  the  sepuel  let  P  C.it.  <  denote  the  probability  of  a  turret  t  dec  i  - 

sion  at  ■  (  '  ,  l.e.  that  the  final  decision  falls  into  R^j)  •:  i  ;  ■  .  , 

i  - 1 , . . .  .  k  if  R(  •  •  is  non-empty  or  that  the  final  decision  is  "no  popula¬ 
tion  is  belter  than  it  R(_)  is  empty. 

Carol  l  Ary  3.  '•  et  (  .1-  ^  ,  <-,)  C  J-  f  ■  j  '  1  and  U  •  •  1 .  If  for 

every  i  C  1  , . .  .  , k  ■  and  .  .  are  (IMP  unbiased  tests  for  versus  t..  based 

on  X..  and  Y.  and  if  s  imu  1  laneous  ly  1- s-  and  l-.;  are  UMP  unbiased  tests  fm 

the  dual  testing  problem  (where  tne  hypothesis  and  the  alternative  a  re  inter¬ 
changed) ,  tnen  at  every  •  t  ^  (s  ,1- ••],.,<•,)  has  the  largest  b  .<.!».•  and 
the  smallest  expected  sampling  amount  among  all  (  _  ,1- < ,)  t  •»'  1  • 

As  is  well  known,  tnese  condi'  .on'-  are  usual  lv  fulfilled  in  one-para¬ 
meter  MLk  and  mu  1 1 i parameter  exponential  family  situations.  The  proof  of 
Corollary  ?■  a  we  1 1  as  that  of  tne  next  result  follow,  from  1 )  ami 
Lemma  1 . 


Corollary  4.  Let  (,  ,  I-, (  j s'.  If  the  power  functions  of  all  tests 

are  ncn-decreu  ,in<j  (nor:  increasing)  in  .ample-si/es  for  .  ■  (  )  ,  then 

k 

P  \C.D.  1  is  nondecrea  ,  i nq  in  .ample  sires  <it  every  1  ‘ 


The  next  result  <  an  tie  stated  with  respect  to  t.  . 


Corollary  5.  Let  (s  ,  1  -  ,  v ,  ^ )  (  A  where  ,  consist1,  of  consistent  tc1 

Then  P  iC.D.t  converges  to  1  if  n.  -  ■  and  •.  /  i  1 . k. 


Proof:  Let  •)  C  with  o  f  i>q,  i  =  l,...,k.  Then 

P|(IC.I).  i  ■  P0lonly  h  . 1  s  with  i  (  il(-)  are  selected  at  Stage  1  .■ 


II  ,  (x.)  II  ' 

j  4R('')  j  J’  1  "J  i  c  R(  )  i 


■  ,  (>■•)) 

i  ,  1  - 1 ,  - 1  ‘ 


which  tends  to  1  for  large  n.,...,n  by  Che  consistency  of  the  te't.s. 

Under  the  assumption  of  monotone  (non-decreasi nq )  likeli'Mod  ratios 
(MLR)  a  stronger  result  can  be  obtained. 

Theorem  2.  Assume  that  in  every  population  .  the  family  of  densities  n,r- 

MLR,  i~l . k.  Let  ( \  ,  1  -  ^  )  f  k.  (or  a')  consist  ol  the  UMP  tests 

for  the  corresponding  testing  problems.  Then  for  increasing  ■•.ample  size1 
ri-,  in.,  i=l,...,k,  P  (final  decision  in  favor  of  that  with  the  largest 
"•  •  a  i  tends  to  1  for  all  1  C  :  k  with  R(  -)  f  (A  and  P  (final  decision  i 

k 

"no  population  is  better  than  "  ■  tend1,  to  1  for  all  ••  C  with 
"l . "k  ’  o  • 

Proof:  Let  n  •  >•  0  •  1  and  P  .  ■  I. 

k  o  1  1  - 1  1  / 

P  j  ( «»  ,l--»i  ,c>»  1-, )  finally  decides  i ..  \ivor-  >■<  ■  . 

<p  (:<k,Ak)  ;p  (a,  ,Yk.,;k)  .  •  .  ■  /  . 

k  k  i 

Now,  P  (X,,A,  )  •  1-t  -p  (X  ,Y.,B  }  i.,  t.en  j*>  t.n  i  tnr  n.  .n'O 

U  'i  k  *  \  -  r.  -  k  1  .  k 

m^  hy  the  consistency  of  the  tests. 

Moreover,  the  procedure  which  decides  ir  r  avoir  of  a  cops!, if  ion  .Kf.nd- 
inq  to  the  largest  p-value  with  respei.t  to  •  u,  t  <  viewed,  <  mu  ;  lei  '  I  ; , 
as  being  based  on  x,  say,  where  ,  '-.sei.t  i .  I  1  .  •  ‘a-  i:h  :  :  hat 


now  standardi  zod  ai  ( in  .tead  •»(  ■■  1,  i  1 . k.  [jut  then  since  all 

►  o  ■ 

populations  ip...,-.  ^  a -e  shitted  into  "alternatives", 

1  k- 1 

P.ip  ■>  p  J/k;  -  p  p  •  |i  ,i/k,  -  /  II  t,.  (X.,V.)d'i  , 

-  -k  ‘i  -  t  ■=  Oil  j  1-' 

(cf.  Miescke  (  i  3  /  9  .i )  )  wk  i  or.  tends  to  1  tor  lame  Mj , . . .  ,i:i^  .  When  cm  re 
than  one  ■  is  lamest  a  similar  insult  can  bo  derived.  The  p  *oof  for  t. 1 

using  UMP  tests  ^  based  t;t:  . is  exactly  the  same.  (he 

second  assertion  of  the  tneorem  is  already  proved  by  Corrollary  b. 

Remark:  If  the  a-.ymi'tot  i  c  relative  et  fieienty  (ART)  in  the  sense  o;  Cit'iia 

is  defined  in  terr..  of  tin  proeal.i  1  ify  of  selecting  the  i  - '  s  with  i  c  (•!(_•) 

at  Staqe  I,  then  1 1  AM  i  •.  ■ ,  ,  .  )  dot's  not  depend  on  i  c  :1 . k ,•  (as  is 

typical  ly  the  case  when  s  j . •  .  are  of  the  same  type  and  .  j . • 

are  of  the  same  i  ype )  we  hove  i  •  ■  -j )  AKL  ( ( ,  1  -  ^  ,  v , s  i 1  -  >  - 1-< 

for  al  l  t)  .  '|  •  I,  U  ...  i  ..ad  .  .  Ihe  proof  is  similar  to  that  in 
Miescke  (1979a).  t't  i  nurv  .  it  would  h»  'io>  e  satisfactory  to  have  an  AKI  - 
concept  including  both  . '  a  ,  .  m.-t  tins  s.-ei”  .  to  tie  a  ditiicult  prob  I  ein. 

In  fart,  bitmap's  .ipprouui  doe-.  tio*  b  a  '  nere  t.o  clear  com  1  us  ions. 

Corollary  <>.  1  r- 1  t  .  .  •  -  . ,  .  ....  *  .  it  tne  oower  functions  of  .  and 

Vj  are  non-  i  nc  roes  mu  i  n  .  <  .  i  '  , .  .  .  k  ,  tfun.  for  •  ,  s  <  I  ,  .  .  .  ,  t  with 

r  s 

( 3.  3 )  ■  f  i  na  1  dei.  s  i  ■  -n  ■  i  ■,  '  aver  o'  1 

r 

!'  '  :  i, a  1  dei  so.  i  .a;  i  .  in  |  a  vnr  ot 

Proof:  ihi  ,  follows  t  *  on  !  »  .c  .  'O'ollurv 


f  i  nal  Remarks : 


(1)  The  results  so  far  derived  hold  analogously  in  cases  where  the 
control  values  uQ  may  depend  on  i  ( (l,...,k). 

(2)  The  case  of  unknown  controls  can  be  treated  analogously  providou 
that  control  samples  are  drawn  independently  for  each  single  test. 

(3)  Under  the  assumption  of  MLk  let  (s  , 1  - , ^ , , , « ^ }  <  s,  consist  of  the 
(JMP  tests  for  the  corresponding  testing  problems.  Ihen 

(3.4)  inf  IP  { C .  D .  J|u  (uk  ,  R(a)  -  inf  i  P  tC.D.  ,i  ■  (  k(  ,  /  0 


[>  i  I-.  . 

v  o’  o’””  o' 

This  follows  from  Theorem  1  in  a  recently  published  paper  by  Simons  J9fio) 
(2.1)  and  Lemma  1 . 

(4)  Let  us  conclude  this  section  by  stating  the  following  three  im¬ 
portant  questions  that  have  not  been  settled  now:  Assume  that  in  all  popu 
lations  MLR  is  given  and  that  S  .  are  the  LIMP  test,  based  on  •.. ,  > 

(x  i  ,Yi ) ,  i--l . k. 

1)  Is  ( ‘i »  I*  u  ,y,  >p)  performing  better  than  (>.  i  i 

II)  [f  this  is  true,  how  well  performs  (',,!•  ,)  in  „  . 

Ill)  The  one-stage  procedures  (  ,  I-  ,,  )  and  (.,1-^)  w  hub  .el  «'•  r  am  or. 
in<i  to  the  largest  p-values  of  the  corresponding  tests.  provided  t-,at  t  ne . 
are  larger  than  I  -  ij ,  are  tne  natural  competitor,  to  ;  ,  ; 

(  i  ,1  Do  they  need  larger  sampling  amount  s  tier  •  tie  tv..  • .  >  a  ■ :« 

I  - 

pror  eilures  f.jke  in  the  near  to  have  thi  same  P  . 


4.  I  he  normal  ui'.is  Bayesian  two  stage  prucedm  a.  for  1 

Assume  trial  we  have  l>  me  i.i  1  ..opula''oie  ,-i  t  »•  ui  »  e>>w.  e,e, 


in. !  mown  va  r  i  am  e 


I  1  ,*  J  I  li  •:  h  i f  *  ■  Mi*  .  t 
1 


When  the  variances  are  unknown  t.he  optimal  procedure  in  *, '  is  based 
on  t-tests  in  an  analogous  way.  Let  s.  and  s •  denote  the  usual  1  iHV  unbi¬ 
ased  estimators  of  o.  based  on  and  Y^,  respectively,  i-l,...,k.  Then 
(l-'ij)  at  Stage  I  has  to  be  replaced  by  n  >s  - 1  ( n;  - 1  , 1  ) ,  and 

'"r*"r1'  (a)  at  Stage  2  has  to  be  replaced  by  m^- s  t(m  - 1  ,  . ) ,  where  t(n,.) 
denotes  the  t-quantile  of  the  t-distribution  with  n  degrees  of  freedom. 

Thouqh  a  (Bayesian)  decision  theoretic  approach  is  quite  difficult  to 
perform  in  general,  the  case  of  kr  2  populations  can  at  least  be  studies 
to  some  extent.  A  two-staqe  procedure  will  now  be  described  by  S(X)(the 
random  subset  of  (1,21  of  indices  of  those  populations  being  selected 
at  Stage  1)  and  d ( X , Y )  (the  final  decision  at  stage  ?.).  As  before,  the  pro¬ 
cedure  stops  and  decides  0,  i.e.  "none  is  better  than  •  "  (l,.:‘/  it 
S(X)  sA  (  I  I  1,121)  and  d ( X ,  V )  at  Stage  2  is  used  only  if  S(X)  f 1 . 

Let  >’ :  P  ►  1R  ,  with  •  ( 0 )  -  0,  tie  a  non-decreasing  function  which  acts 
as  loss-qain- function  with  respect  to  final  decisions  1  and  Assume 
that  decision  0  leads  neither  to  a  loss  nor  a  gain.  Moreover,  let.  c  u 
be  the  costs  we  have  to  pay  if  we  wise  lo  pet  form  Stage  2.  lieally,  let 
1  be  the  prior  distribution  of  the  (now  rarido- :)  parameter  ve*.  f  or  .  Ihe-s 
the  overall  Bayesian  risl  is  qi  m,  bv 

(4.1)  /  jlc  ♦  V  -  if  !'!  _.•  .  .  ;  j;  ;•  •:  I..  } 

til 

7 

+  .  i  -  m>  js ;  >  ■  j  (  i 

j-i  '  0  ’*  ' 

Ibe  optimal  decision  d*  at  Stair  when  •:  '  r  i  / « •  ■ .  to<  >•,,•  ,, . 

pec  ted  loss  given  X  and  Y  ■  )  does  t.  it  i« -t .< > r ■  1  •■u  (nr  ,;.ei  la  1  <  urn  r  •• 
any  subset  selection  rule  S  arid  turn  out  ■  * ,  be 


1') 


(4.2)  d*( =  i  iff  Ef (l'0*  i  )  I  X  -  ,  y=r,| 

<  minjy,Ei.  (' o-o.)  |X-^,  Y  =  ni},  !  i (1,2|  , 
and  d*(f,n)  0  otherwise. 

The  optimal  subset  selection  rule  S*  at  Stage  1  (which  minimizes  t  tin 
posterior  expected  loss  given  X  -  under  the  assumption  that  d*  will  he 
used  at  Stage  2)  decide'  according  to  the  smallest  of  the  four  values  give 
in  the  following  scheme: 

(4.3)  S*(fJ  -  0:  0 

S*(;,)  Mi;  FM  •0-oi);,j-  m.  Ml.?  . 

S*(f.)  =  ;  1 ,2  , :  c+LyninJo,  min  f  ■  ■  (■  —  '  . )  ■  X=  ,Y  t }  j  X”/.  j  . 

Notethat  in  tne  last  expression  the  inner  conditional  ?xpect.  iat.  ion  is 
viewed  as  being  a  function  ot  Y,  and  that  the  outer  one  is  the  expecta¬ 
tion  with  respect  to  the  conditional  distribution  of  Y-given  X 

Now  let  us  assume  the  following  normal  model:  < onditional ly,  given 
o  =  o,  X  and  Y  are  independent  witli  X  *  ( •  ■ , p  1  )  and  Y  '•  (•-,<( I  ) ,  and 

apriori  -  ?.  (*>02 ,  rl ) ,  p,q,r  •  n.  I  (,]]’).  ]’  (U). 

Then  by  using  for  convenience  IJ,  V,.  V.,,  which  are  assumed  to  he  in¬ 
dependent  standard  normals,  we  get  the  following  scheme  equivalent  tn  (4.  < 

(4.4)  S*(')  0  :  0 

S*()  "  i  ' :  F  ’’{ -  (i  )  -  .  )+ (Vp(par)  1  )•  U  )  )  ,  i  1,2 

\  U  I  ' 

t  *  '  i 

S*(  )  ;  c+l  1  ‘‘ 1  min  j'J.min 

11,2 


where  ,  -  pr[ ( p  +  r ) (pg+pr+or ) )  •  and 


i  epg/ (iig*pr*-gr  )  ]• 


r 
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Let  especially  v.  be  linear,  i.e.  v(a)  a.'.,  ;  C  IK,  where  we  can  as¬ 
sume  without  loss  of  generality  that  a^  1  holds  (since  this  can  be  com¬ 
pensated  by  c).  Moreover,  let  us  for  a  moment  restrict  our  considerations 
to  two-stage  procedures  which  at  Stage  2  are  not  permitted  to  make  deci¬ 
sion  0.  (This  corresponds  to  procedures  in  c.1  or  i  with  ^  -  0.)  Tnen 
the  optimal  procedure,  denoted  by  d*  and  S*,  can  be  described  in  a  concise 
form. 

(4.5)  d*(f,,n)  =  i  iff  qt-  +  pirn  q ;  f  pn-,  n  , j  t  -1,2 

'  "  1  i  j  J 

and  S*  decides  according  to  the  smallest  of  the  4  values  given  in  the  fol¬ 
lowing  scheme: 

(4.6)  S*(0  =0:0 

S*(f,)  =  lil:  ) *  i  =  l  .2, 

S*(c)  -  (1,2):  ^(uo-maxi..-1,--.?  0+0-2-,  T(-  (2-.)'1'  ..  )  , 

-1  y 

where  a  =  r(p+r)  and  T(y)  =  /  :  (x)dx,  y  (  IR  . 

The  last  expression  follows  from  Lemma  3  in  Miescke  (1979b).  5 i rice  I 

is  an  increasing  function  with  T( 0)  (2..)'  +  ,  the  procedure  will  never  ar¬ 
rive  at  Stage  2  if  c  ■  s •  n ~  .  But  on  the  other  hand,  let  c  •  .  .  ;y.  heUr 

an  ii  j  with  .  .  j ,  0^ ,  li  ,j  1=  (1 ,2  : ,  wi  i  1  he  selected  by  S* .  But  now  if  |  ■  ^ 

or  £  ■  "0  then  S*(/.)  -  fl,2t  if  and  only  if  y  }  "  I  • 

Moreover,  if  ■  i ,  ,  ■  o  there  is  ar.  area  in  the  neighborhood  of  (  ■  , 
where  also  S*(c.)  -  {1,2  occurs.  Thus  within  IK/\-' -m  .  •  .  is 

of  the  type  of  Gupta's  (1965)  maximum  means  procedure. 


If  now  more  generally  a  decision  0  is  also  admitted  at  .Stage  then 
the  optimal  procedure  (S*,d*)  is  of  similar  form  hut  is  no  longer  rep,  o- 
sentable  in  such  a  concise  manner.  typically,  the  area  where  at  Siaoe  1 


In 

both  populations  are  selected  will  be  larger. 

Finally,  let  us  mention  that  one  gets  analogous  results  if  o trier  loss 
functions  are  admitted.  It  is  thinkable  that  especially  ■  (.')  *  .  ,) 

if  A  >  (<)0,  ; ,  ,  .'2  '  leads  to  a  procedure  which  is  closer  to  that  one 

given  at  the  beginning  of  this  section.  But,  unfortunately,  its  representa¬ 
tion  is  more  complicated  such  that  this  question  could  be  studied  only  num¬ 
erical  ly. 
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i=l,. ..,k,  where  :•  (  IR  is  a  fixed  given  constant.  The  goal  is  to  guarantee 


at  least  a  probability  P*  of  making  a  correct  decision  if 


,  i  1 . k  . 


and  to  maximize  the  probability  of  finding  .1  population  better  than  other¬ 
wise. 

Two-stage  procedures  of  tne  following  type  will  be  studied:  At  Stage  I, 
based  on  samples  X^,.,.,X^,  all  populations  are  screened  out  which  appear  to  be 
no  better  than  >■  If  none  (exactly  one)  is  left  t.he  procedure  stops  and  de¬ 
cides  that  none  (this  one)  is  better  than  .  If  more  than  one,  n.  with  i  cs, 
survives  then  one  proceeds  to  Stage  Here  additional  samples  Y.,  1  ss,  are 
drawn  and  final  decision  is  made  based  on  Y.  or  (X.,Y.),  i  € s. 

A  natural  class  of  two-stage  procedures  is  proposed  which  can  be  complete¬ 
ly  described  and  studied  in  terms  of  Neyman- Pears on  testing  theory,  where  the 
un symmetry  of  tests,  however,  can  be  overcome  to  a  considerable  extent.  As  a 
typical  result  it  is  shown  that  optimality  of  tests  carries  over  to  optimality 
of  two-stage  procedures,  finally,  under  normality,  comparisons  are  made  in 
case  of  k-  ^  with  certain  Bayesian  procedures. 
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